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SPECIFICATION 

TITLE OF THE INVENTION 

VEHICLE MOUNTED CONTROL APPARATUS 

5 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a vehicle mounted control 

apparatus for performing a control in accordance with a voice 
1 0 command . .. ><•• 

Description of the Related Art 

There has conventionally been provided a car navigation 

apparatus that outputs a conversational operating instruction 

by means of a screen or a voice to assist the key entry of a 
15 command by a user (for example, patent document 1) . 

Moreover, a car navigation apparatus that can directly 

receive a command by a voice in a command execution mode while 

itdoes notdisplay an operating procedure , has also been provided 

as amethod capable of entering a commandmore quickly by entering 
20 a conversational command. 

[Patent document] Japanese Unexamined Patent Publication No. 

2000-97719 

However, when a command by a voice (hereinafter referred 
to as "voice command") is entered to a car navigation apparatus, 

25 depending on user's vocal conditions (for example, the level 
of voice uttered by a user) , the car navigation apparatus cannot 
recognize the voice in some cases. In these cases, the user 
cannot understand which of the following reasons makes it 
impossible for the car navigation apparatus to recognize the 

30 voice and hence is embarrassed: a voice command is improperly 
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issued; vocal conditions are not good; and further, the apparatus 
is not operated because a switch for performing voice recognition 
is not pressed. 

Therefore, it is an object of the present invention to 
5 provide a vehicle mounted control apparatus that can inform a 
user about the state of recognition of a voice command uttered 
by the user in such a way that the user can easily understand 
the state of recognition. 

10 SUMAMRY OF THE INVENTION 

A vehicle mounted control apparatus of the present 
invention includes voice recognition section for recognizing 
a voice command that a user inputs by means of voice input device 
connected to outside; and a control section that analyzes a cause 

15 of incapability of recognition of the voice command when the 
voice command cannot be recognized by the voice recognition 
section and changes the display format of a screen of display 
device connected to outside on the basis of results of the 
analysis . 

20 By this arrangement the control apparatus can provide the 

user with an occasion of practice of operation when the user 
wants to know an operating method and makes the user acquire 
the operation effectively. Moreover, the vehicle mounted 
control apparatus can also finish learning a parameter for 

25 recognizing the voice of the user through the practice for 
acquiring this operating method. Hence, it is possible to 
produce an effect of improving the rate of voice recognition 
and decreasing a work of the user even if the user does not conduct 
an additional work of inputting voice for the vehicle mounted 

30 control apparatus to be learned about voice of the user. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig . 1 is a function block diagram to show a vehicle mounted 
control apparatus in accordance with embodiment of the present 
5 invention. 

Fig, 2 is a perspective view to show the driver's seat 
of a vehicle on which is mounted the vehicle mounted control 
apparatus in accordance wi th embodiment of the present invention . 

Fig. 3 is a flow chart to show the whole processing of 
10 the vehicle mounted control apparatus in accordance with 
embodiment of the present invention. 

Figs . 4A, 4B are illustrations to show the display content 
of a display device in the vehicle mounted control apparatus 
in accordance with embodiment of the present invention. Fig. 
15 4A illustrates the display content in which is displayed an icon 
for displaying the state of reception of voice and Fig. 4B 
illustrates the display content by which is displayed providing 
the user with the state of operation. 

Fig. 5 is an illustration to show the display content of 
20 the display device in the vehicle mounted control apparatus of 
embodiment of the present invention and illustrates a change 
in the display mode of a screen. 

Figs . 6A, 6B are illustrations to show the display content 
of a display device in the vehicle mounted control apparatus 
25 in accordance with embodiment of the present invention. Fig. 
6A illustrates the display content to provide the user with a 
menu capable of providing guidance on an operation and Fig. 6B 
illustrates the display content to provide the user with the 
content of a selected operation. 
30 Fig. 7 is an illustration to show the correspondence 
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between analysis result and display format which is stored in 
memory of the vehicle mounted control apparatus of embodiment 
of the present invention. 

Fig. 8 is an illustration to show the correspondence 
5 between the content of voice command and their example sentences 
which are stored in memory of the vehicle mounted control 
apparatus of embodiment of the present invention. 

BEST MODE FOR CARRYING OUT THE INVENTION 
10 In order to describe the present invention in more detail , 

the best mode for carrying out the present invention will be 
hereafter described with reference to the accompanying drawings . 

Embodiment 1 

Fig. 1 shows a vehicle mounted control apparatus of this 
embodiment 1 and a car navigation apparatus is here shown as 
one example of the apparatus . In Fig. 1 , a main device 1 includes 
a control section 2 constructed of a microprocessor or the like, 
a memory 3 that is connected to the control section 2 and stores 
programs and various kinds of data, a map information storage 
section 4 that is also connected to the control section 2 and 
stores the digital data of maps, and a voice recognition section 
5 (voice recognition section) that recognizes voice signals 
entered from a microphone 8 (voice input device ) . The control 
section 2 has a function that controls a display device 6 connected 
to the main device 1 and displays route information and road 
information necessary for navigation on the display device 6. 
As for the display device 6, a liquid crystal display device 
is generally used but all types of display devices can be used. 
The display device 6 may be integrated with the main device 1 
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or may be integrally embedded in a portion of interior surface 
of a vehicle. 

The control section 2 calculates the position of this 
apparatus itself by a calculation method using a publicly known 
5 technology from signals that are sent by a plurality of satellites 
and received by a GPS (Global Positioning System) antenna 7 . 
The microphone 8 converts user's voice into an electric signal 
and outputs the electric signal to the voice recognition section 
5. The speaker 9 outputs voices, effect sounds, and sounds of 

10 music on the basis of the control of control section 2 . 

The interface 10 has a function of relaying the operational 
status signals of electronic devices of the car and control 
signals to these electronic devices, for example, a control 
device of an air conditioner, head lights, and sensors for 

15 detecting the on-off states of a wiper and the head lights (all 
of which are not shown) , between the control section 2 of the 
apparatus and these electronic devices. 

The input device 11 is a device to detect a command from 
a user and to output a signal corresponding to this command to 

20 the control section 2 . As this input device 11 can be used various 
types of input devices capable of converting a user's command 
into a signal, for example, one or a plurality of buttons, a 
tablet, a touch sensor mounted on thedisplay device 6 , a joystick, 
and a lever mounted on a vehicle body. 

25 A conversation starting switch 12 is a device for 

outputting to the control section 2 a signal to inform that the 
switch 12 is pressed by the user when it is pressed by the user. 

Fig. 2 is a perspective view of the driver's seat of a 
vehicle to which a vehicle mounted control apparatus in 

30 accordance with this embodiment is applied. The microphone 8 



6 

is mounted in a position where driver 1 s voice can be easily input, 
for example, on a dashboard forward of the driver's seat, and 
the conversation starting switch 12 is mounted in a position 
where the user can easily press it during driving the vehicle 
5 without interfering with driving the vehicle, for example, on 
a steering wheel • 

This vehicle mounted control apparatus has two operation 
modes of a command execution mode and an operation guidance mode . 

First, the command execution mode is a mode for performing 

10 an ordinary operation. In the command execution mode, when a 
user inputs voice to the apparatus , the voice recognition section 
5 recognizes a voice command from the user and the control section 
2 executes a processing corresponding to the voice command. By 
this command execution mode, it is possible to control various 

15 kinds of vehicle mounted electronic devices, for example, to 
set a destination and to start a route guidance in navigation, 
and to control air volume of an air conditioner and an audio 
device, and to control electronic devices provided outside the 
vehicle through the Internet, such as an electronic device for 

20 electronic mail and Internet ITS (Intelligent Transport 
Systems) . 

On the other hand, the operation guidance mode is a mode 
for giving the user a description of a method for performing 
voice input and the user can recognize the operating procedure 

25 and voice command of the vehicle mounted apparatus without 
looking at an operating manual . At thispoint, switching between 
the command execution mode and the operation guidance mode can 
be performed, for example, by pressing a selector button or by 
selecting a menu displayed on the display device 6. 

30 Fig. 3 is a flow chart of control section 2 that shows 
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the command execution mode and the operation guidance mode. 
First, when the vehicle mounted apparatus is started, an initial 
screen at startup is displayed (step ST1) . The control section 
2 displays a message of "Please select a user" on the initial 
5 screen and displays a list of a plurality of users previously 
registered in the memory 3. The user looking at this display 
selects one user of the list. The control section 2 detects 
a signal output by the input device 11 and identifies the user 
on the basis of this signal (step ST2) . 

10 Next, the control section 2 detects an input signal from 

the input device 11 or the conversation starting switch 12 and 
checks whether this input signal is a signal to select executing 
an operation guidance mode or a signal to select executing a 
processing of command execution mode (step ST3) . Here, when 

15 the user presses down the conversation starting switch 12, the 
control section 2 executes the processing of command execution 
mode at steps ST5 to ST8 . On the other hand, when the user selects 
the operation guidance mode, the control section 2 executes a 
processing of operation guidance mode at steps ST10 and ST11. 

20 First, the operation of vehicle mounted apparatus in the 

command execution mode will be described. 

First, the control section 2 waits for an input signal 
from the input device 11 and/or the input of voice command from 
the microphone 8 (step ST4) and at the same time, as shown in 

25 Fig. 4A, displays on the screen a "voice reception state icon" 
13 of "VOICE" to show a state of reception of voice command. 

When the user inputs voice to the microphone 8, the voice 
recognition section 5 recognizes the input voice. At this time, 
the voice recognition section 5 reads a recognition parameter 

30 for the user ident i f ied at s tep ST2 from the memory 3 and recogni z es 
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the voice by the use of this parameter (step ST5) . For example, 
in a case where the user desires to retrieve a near gas station, 
when the user utters a voice command of "Near gas station" , "Near 
gas station" is input from the microphone 8. 
5 Next, the control section 2 identifies which command of 

a plurality of voice commands uttered by the user is input . If 
the control section 2 can identify the voice command, the control 
section 2 determines that the state of voice of voice command 
is good and executes a processing corresponding to the identified 

10 voice command (step ST7) . 

On the other hand, when the control section 2 cannot 
identify the voice command, the control section 2 determines 
that a state of voice for the voice command is not good and analyzes 
the state of voice (step ST8) . At this point, the analysis of 

15 state of voice will be below described in detail. Further, the 
control section 2 changes display on the screen of display device 
6 on the basis of the analysis results of state of voice and 
again waits for a voice command to be input (step ST9) . 

Here, changing display on the screen means displaying an 

20 icon corresponding to the state of voice, changing the color 
of an icon or the whole screen which are already displayed in 
response to the state of voice, blinking the icon or the whole 
screen which are already displayed (in this case, an interval 
of blinking or the pattern of blinking is changed in response 

25 to the state of voice) , displaying the state of voice as character 
information, and further using these operations in combination. 
In this regard, any display method can be employed as far as 
the user can visually recognize the state of voice when the user 
looks at the screen of display device 6. 

30 Finally, the control section 2 detects on the basis of 
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an electric signal or the like from the input device 11 or a 
power supply key via the interface 10 whether or not the user 
performs an operation of finishing a processing of vehicle 
mounted apparatus (step ST12) . At this point, when the user 
5 does not perform the operation of finishing, the control section 
2 repeats the processing following step ST3. On the other hand, 
when the user performs the operation of finishing, the processing 
is finished. Here, "Near gas station" as the voice command is 
input by the user, so the vehicle mounted apparatus recognizes 
10 the voice command and hence retrieves a near gas station and 
displays the result of retrieval. In this regard, as shown in 
Fig. 4B, in order to show the state of operation to the user, 
"Near gas station is now being retrieved" is displayed during 
the operation . 

15 Next, the operation of vehicle mounted apparatus in the 

operation guidance mode will be described. At this point, for 
the sake of convenience in description, a case where a facility 
is retrieved so as to set a destination will be described as 
an example. 

20 When the user selects the operation guidance mode, the 

control section 2 displays an initial screen on the display device 
6 (step ST10) as shown in Fig. 6A. This initial screen is a 
screen to provide the user with a menu capable of giving the 
user guidance on operation by the use of operation guidance mode . 

25 In this case, the operation guidance mode includes "basic 
operation", "address retrieval", "facility retrieval", and 
"periphery retrieval " . 

Next, the control section 2 detects a command input to 
the input device 11 by the user, and when the command is "facility 

30 retrieval", the control section 2 displays an operation 
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describing screen shown in Fig, 6B (step ST11) . In this display 
of describing operation, the control section 2 reads from the 
memory 3 a method of uttering a voice command relating to the 
selected command and displays it on the display device 6 . Since 
5 "facility retrieval" is selected in this case, the control 
section 2 displays "Utter the names of the prefecture and the 
name of facility consecutively" as the method of uttering the 
voice command corresponding thereto. 

Moreover, the control section 2 reads from the memory 3 

10 an example sentence of a voice command relating to the command 
now selected and displays the example sentence on the display 
device 6. Since "facility retrieval" is selected in this case, 
"XX Zoo, Tokyo", "YY Station, Kanagawa Prefecture" and "ZZ Land, 
Chiba Prefecture" are displayed as example sentences. Here, 

15 it is also recommended that the displayed sentence is output 
as voice from the speaker 9 to make the user recognize the sentence 
as voice. 

In this regard, it is also recommended that when the control 
section 2 detects "to look at demonstration" as a command input 

20 to the input device 11 by the user, as shown in Fig. 6B, general 
operation including the operation of input device 11 (for example, 
"to press the conversation starting switch" at the time of 
inputting voice) be described. That is, in this case, when the 
user utters "the name of prefecture" and "the name of facility", 

25 the user needs to press the conversation starting switch 12 and 
hence in order to make the user know that he needs "to press 
the conversation starting switch at the time of inputting voice" , 
it is visually displayed. 

Next, the operation of analysis of the state of voice will 

30 be described. 
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Assume that the user issues a voice command while pressing 
the conversation starting switch 12. When the control section 
2 detects that the conversation starting switch 12 is pressed 
down, the control section 2 records voice received by the 
5 microphone 8 in the memory 3 while detecting the switch 12 being 
pressed down . 

Next, when the control section 2 finishes detecting that 
the conversation starting switch 12 is pressed down, the control 
section 2 finishes recording the voice in the memory 3 and 

10 instructs the voice recognition section 5 to analyze the recorded 
voice. The voice recognition section 5 analyzes the features 
of voice according to a publicly known voice recognition 
algorithm and records the analysis results in the memory 3 . For 
example, the voice recognition section 5 compares the standard 

15 pattern of voice stored in the memory 3 with the pattern of input 
voice to analyze a plurality of features, for example, sound 
level, uttering speed, uttering timing, and whether or not 
unnecessary words (hereinafter referred to as "incidental word") 
are included and outputs analysis results. 

20 Next, the control section 2 determines whether or not the 

input voice is good on the basis of analysis results. Any 
parameter which shows the ease of voice recognition, can be used 
to determine whether or not the input voice is good. For example, 
when it is detected that any one of sound level, uttering speed, 

25 uttering timing, and the presence or absence of incidental words 
is "not good", the control section 2 determines that total 
analysis result is "not good". On the other hand, when all the 
analysis results for each i temof analysis are "good" , the control 
section 2 determined that total analysis result is "good". 

30 Here, when the control sections 2 determines that the 
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analysis result is good, as described above, the control section 
2 executes the recognized voice command. On the other hand, 
when the control sections 2 determines that the analysis result 
is "not good", as described above, the control section 2 changes 
5 the display of screen on the basis of display format associated 
with the analysis result. The analysis result corresponds to 
the display format in a manner shown in Fig. 7 and this 
correspondence is stored in the memory 3. For example, when 
the control section 2 recognizes that the analysis result is 

10 "not good" because "sound level is high", the control section 
2 refers to a state-of -voice correspondence table stored in the 
memory 3 and recognizes that a color corresponding to a case 
of "sound level is high" is "blue". Further, the control section 
2 changes the color of an icon displayed on the screen, for example, 

15 the color of the above described icon 13 of "VOICE" to show the 
state of reception of voice, to a state shown in Fig. 5 and again 
waits for an input of command by voice. At this point, the icon 
is not limited to the icon 13 of "VOICE" to show the state of 
reception of voice but may be an icon to show the present position . 

20 Further, the whole screen may be changed to blue. 

Here, in the analysis results, "sound level is high" and 
"sound level is low" are the analysis results from a viewpoint 
of "sound level", and "conversation speed is high" and 
"conversation speed is low" are the analysis results from a 

25 viewpoint of "conversation speed", and "conversation timing is 
early" and "conversation timing is late" are the analysis results 
from a viewpoint of "conversation timing" . Hence, as for a color 
corresponding to the analysis results from the same viewpoint, 
a color of the same group may be used. That is, as described 

30 above, "sound level is high" and "sound level is low" are the 
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analysis results from a viewpoint of "sound level", so they are 
displayed in a color of the same group, for example, "sound level 
is high" is displayed in "blue" and "sound level is low" is 
displayedin "greenish sky blue" . Further, when a level is higher 
5 than a standard, for example, "high" and "early", a dark color 
is used and when a level is lower than a standard, for example, 
"low" and "late", a light color is used. With this, the user 
understands what is wrong from the group of color and can get 
a sense of "higher than a standard" or "lower than a standard" 
10 from the shades of gray. 

Next, an analysis of voice will be described in detail. 

For example, whether the input voice is good or not is 
determined by the following method. 

(1) Presence or absence of incidental word at the beginning of 
15 sentence 

The voice recognition section 5 compares the standard 
pattern of an incidental word stored in the memory 3 with the 
input voice and produces an output to the effect that analysis 
result is "not good" when the beginning of voice is in good 
20 agreement with the standard pattern of incidental word. On the 
other hand, when the voice does not have any incidental word 
at the beginning of voice, the voice recognition section 5 
produces an output to the effect that analysis result is "good" . 

(2) Sound level 

25 The voice recognition section 5 detects whether or not 

the sound level of input voice is within a predetermined range 
and produces an output to the effect that analysis result is 
"not good" when the sound level is not within the predetermined 
range. On the other hand, when the sound level is within the 

30 predetermined range, the voice recognition section 5 produces 
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an output to the effect that analysis result is "good". The 
predetermined range is set at a range of less than 0 dB to -30 
dB when the maximum value of sound level to be detected by the 
microphone 8 is 0 dB. 
5 (3) Length of conversation time 

The voice recognition section 5 measures the length of 
time of the input voice and compares this length of time with 
the length of time of the standard pattern and determines that 
this length of time is good when a difference between them is 

10 within a predetermined range. 

This predetermined range can be arbitrarily set and, for 
example, a difference of the length of time of the input voice 
can be set at a range of +25 % to -25 % of the length of time 
of the standard pattern. 

15 (4) Uttering timing 

The voice recognition section 5 determines whether or not 
the uttering timing of input voice is within a predetermined 
range. At this point, the voice recognition section 5 produces 
an output to the effect that analysis result is "good" when the 

20 uttering timing is within the predetermined range . On the other 
hand, when the uttering timing is not within the predetermined 
range, the voice recognition section 5 produces an output to 
the effect that analysis result is "not good". For example, 
when voice has been begun to input at the beginning of a period 

25 during which the microphone 8 can acquire the voice, the voice 
recognition section 5 produces an output to the effect that 
analysis result is "not good" . On the other hand, in cases where 
the input voice has not yet completed to utter for a period more 
than the maximum input time of voice command or where a voice 

30 input is detected only at the end of the above described period 
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during which the microphone 8 can acquire the voice, the voice 
recognition section 5 produces an output to the effect that 
analysis result is "not good". In the cases other than these 
cases, the voice recognition section 5 produces an output to 
5 the effect that analysis result is "good". 

(5) Presence or absence of incidental word at the end of sentence 
As is the case with the detection of an incidental word 
at the beginning of a sentence, the voice recognition section 
5 detects whether or not there is an incidental word at the end 

10 of a sentence. When the voice recognition section 5 detects 
an incidental word, the voice recognition section 5 produces 
an output to the effect that analysis result is "not good". On 
the other hand, when the voice recognition section 5 does not 
detect an incidental word at the end of sentence, the voice 

15 recognition section 5 produces an output to the effect that 
analysis result is "good". 

Next, a set of example sentences will be described in detail . 
Fig. 8 is a list in which the contents of voice commands are 
correlated with example sentences that the user utters so as 

20 to execute the respective voice commands. 

The sentences of the contents (a) to (e) of voice commands , 
shown in this drawing, include a plurality of commands of 
different diction and the voice recognition section 5 can 
recognize voice commands on the basis of a plurality of voices 

25 uttered by the different diction. 

In this regard, while the above description has been given 
with a particular emphasis on the car navigation apparatus, the 
vehicle mounted control apparatus is not limited to this example 
but the present invention can be applied to any type of vehicle 

30 mounted control apparatus for operating electronic devices 
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inside and outside of the vehicle. For instance, the vehicle 
mounted control apparatus includes of a control apparatus for 
controlling vehicle mounted electronic devices, for example, 
an air conditioner, an audio unit and the like. Moreover, as 
5 a control apparatus for controlling electronic devices outside 
of the vehicle may be thought, for example, a control apparatus 
for controlling various kinds of electronic devices outside of 
the vehicle through a transmitter which is connected to the 
interface 10. The electronic devices outside of the vehicle 

10 include all kinds of electronic devices which are connected by 
a communications lines , for example, an air conditioner intended 
for home use or commercial use, a home security system, a home 
server, other electric appliances, an automated paying machine 
installed at a store such as a fast food store or a gas station, 

15 and a gate installed at the entrance and exit of a parking lot. 

Further, in the control apparatus in Fig. 1, while an 
example has been shown as the voice recognition section 5 which 
is constructed of an LSI for voice recognition, the voice 
recognition section 5 is' not limited to a case where it is 

20 constructed of a hardware circuit designed exclusively for voice 
recognition but may employ a software program for voice 
recognition. In this case, the program for voice recognition 
is executed by the control section 2 or a separate processor 
used exclusively for voice recognition. 

25 Still further, in the above embodiment, while an example 

of the voice recognition processing is given in which the voice 
recognition section 5 performs voice analysis as a preliminary 
processing and the control section 2 executes a processing of 
displaying advice by use of the result of analysis , but the control 

30 section 2 can executes also the voice analysis. 
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Still further, in the above described embodiment, while 
the recognition parameters of apluralityof users are registered, 
because there are also cases where only one specified user uses 
the control apparatus, it is not always necessary to classify 
5 the users. In a case where only one specified user uses the 
control apparatus, a processing of registering users at step 
ST2 in Fig. 3 is not required. 

As for the memory 3, a volatile and/or nonvolatile memory 
can be used. Moreover, a storage device such as a hard disk 
10 and DVD-RAM can be used as storage device. 

Industrial Applicability 

As described above, a vehicle mounted control apparatus 
in accordance with the present invention is applicable to a car 
15 navigation apparatus. 



